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O ! ABSTRACT 

^ ' We develop a new empirical methodology to study the relation between the stellar mass of 

^ \ galaxies and the mass of their host subhaloes. Our approach is similar to abundance matching, 

and is based on assigning a stellar mass to each subhalo within a large cosmological A^- 
body simulation. However, we significantly extend the method to include a different treatment 
, for central and satellite galaxies, allowing the stellar mass of satellite galaxies to depend on 

■ both the host subhalo mass, and on the halo mass. Unlike in previous studies, our models 
\ are constrained by two observations: the stellar mass function of galaxies, and their auto- 

■ correlation functions (CFs). We present results for ~ 10^ different successful models, showing 



that the uncertainty in the host subhalo mass reaches a factor of ~10 for a given stellar mass. 
Our results thus indicate that the host subhalo mass of central and satellite galaxies is poorly 
CO . constrained by using information coming solely from the abundance and CFs of galaxies. 

' In addition, we demonstrate that the fraction of stellar mass relative to the universal baryon 

fraction can vary between different models, and can reach ^ 0.6 for a specific population 
of low mass galaxies. We conclude that additional observational constraints are needed, in 
order to better constrain the mass relation between haloes and galaxies. These might include 
weak lensing, satellite kinematics, or measures of clustering other than the CFs. Moreover, 
CFs based on larger surveys with a better estimate of the systematic uncertainties are needed. 
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1 INTRODUCTION 

The relation between the stellar mass of galaxies and their host 
dark-matter haloes has become a key point of reference for many 
different theoretical and observational studies. It summarizes in 
a simple way the complexity of galaxy formation physics when 
evolved within growing dark-matter structure. A special attention 
was given to the relation between galaxies and their host subhaloes, 
which are the sub-structure bound density peaks inside a halo. 

The 'abundance matching' (hereafter ABM) methodology is 
an important theoretical tool for constraini ng the mass relation 



between galaxies and t heir host subhaloes jKravtsov et al.l 
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IGuo et al.ll2oTol : iMoster et al.ll2010l : iBehroozi et al]|2010h . In this 
simple empirical approach one assigns a model galaxy to each sub- 
halo within a cosmological A^-body simulation. Assuming there is 
a one-to-one, monotonic relation between the stellar mass (m*) and 
the subhalo mass (Minfau, see section lZTT l, the abundance of galax- 
ies and subhaloes can be matched, yielding a unique relation be- 
tween m* and Afinfaii. Surprisingly, this simple model provides a 
good fit to the observed clustering properties of galaxies. 

The ABM approach thus offers a practical solution to the rela- 
tion between subhaloes and galaxies, without going into the com- 
plex details of galaxy formation physics. It can be used to con- 
strain the mass relation at various redshifts, to predic t the star- 
formation rate in galaxies (IConrov & Wechslej |2009|) , to study 
the merger-rates of galaxies jHopkins et al.ll20IO|). and to interpre t 
high-resolution hydrodynamical simulations ( Sawala et alj|201lh . 
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In comparison to models ba s ed on the halo occupation distribution 
teeriind & Weinberg! I2OO2I : ICoorav & Sliettil I2OO2I: [Tinker et al.l 
I2OO5I : IZehavi et al.ll200l and references therein), ABM uses ex- 
plicit information on the location and mass of subhaloes, decreas- 
ing the number of free parameters needed in the model. 

The success of ABM is intriguing and raises several interest- 
ing questions: Is it based on the only possible set of assumptions 
that can reproduce the abundance and clustering of galaxies? Do we 
miss models that result in a different — Afinfaii relation? What 
are the important assumptions made by ABM? How can we ex- 
plore these assumptions and test to what level they are constrained 
by observations? In this work we try to address these questions. We 
specifically focus our results on the freedom in the — Minfau 
relation. 

In ABM, the stellar mass is assigned to each subhalo accord- 
ing to Afinfaii- For a satellite subhalcQ. A/infaii is defined as the 
mass at the last time it was the most massive substructure within 
its FOF group. This is a reasonable assumption because the sub- 
halo mass of satellite galaxies can be signi ficantly stripped af- 
ter falling into a larger da rk-matter halo (e.g. IZentner et alj|200^ ; 
Ivan den Bosch et al J I2OO5I) , more so than the stellar mass of its 



galaxy (for stellar stripp i ng see Monaco et al. 2006; Pu rcell et al.l 
l2007lJconrov et al.l200'7l : lYang et alj2009bl) . On the ot her hand, for 
a centr al subhalo, Afinfaii is defined as its current mass. I Wang et al.l 
( I2OO6I) found that the relation between Minfau and the stellar mass 
of galaxies is tight in semi-analytic models, justifying the above 
definition of MintaU. 

There are various other assumptions made by ABM, which 
are mainly related to the treatm ent of satellite galaxies. In a re- 
cent paper jNeisteinetallbOlOl hereafter paper I), we have exam- 
ined these assumptions usin g the semi-analytic model (SAM) of 
iNeistein & WeinmannI ( I2OI0I) . Although ABM models assume that 
for a given Minfau, the stellar mass of central and satellite galax- 
ies is the same, there are various effects within SAMs that vio- 
late this assumption. First, the relation between stellar mass and 
subhalo mass evolves with redshift for central galaxies, affecting 
satellite galax ies at the time of infall (the typical infall time is a 
few Gyrs ago: IWang et al.ll2007l) . Second, the stellar mass of satel- 
lite galaxies might be different already at the time of infall from 
that of central galaxies at the same time. This is because galaxies 
that join larger groups as satellites are located in large-scale en- 
vironment of higher density than the average. Consequently, the 
dark-matter merger-histories of these galaxies are already differ - 
ent at early epochs (see also lGao et al.ll200^ : iHarker et al.ll2006l ). 
Third, once a galaxy becomes a satellite, its stellar mass might 
still grow. This is especially true for models in which gas strip- 
ping i n satellite galaxies is modeled on time-scales of a few 
Gyrs JWeinmann et al.l boiot) . All the effects above are consis- 
tent with vari ous studies about the properties of central and satel- 
lite galaxies IWeinmann et alj l2006l: Ivon der Linden et al.l |2007|; 



Khochfar & OstrikeJ I2OO8I : ISkibba & ShethI l2009l : IPasquali et ^ 
2009l : ISkibbaetal.ll201ll) . 



In paper I, we showed that the effects above can change not 
only the m* — Afintaii relation for satellite galaxies, but also the 
auto-cortelation function (CF) of galaxies. If satellite and central 
galaxies with a given Afinfaii are randomly redistributed in a SAM, 
the CFs can vary by up to a factor of four. Even when just redis- 
tributing satellite and central galaxies among themselves, the mod- 



ifications in the CFs can reach a factor of 2 for massive galaxies. 
We have shown in paper I that the CFs of SAM galaxies can be 
reproduced accurately by the ABM approach only when the stellar 
mass of satellite galaxies is assumed to depend on both A/infaii and 
the host halo mass at z = 0. This finding is very useful as it saves 
us the complex modeling of m* as a function of the various effects 
mentioned above. 

In this paper we make use of the conclusions made in paper 
I, and add more ingredients to an ABM model. We would like to 
study the influence of the new ingredients on the relation between 
Afinfaii and m*, while the models are constrained to fit the observa- 
tional data. We allow the stellar mass of satellite galaxies to depend 
on both the A/infau and the host halo mass, and to deviate from 
the behaviour of central galaxies. In addition, we use two ingredi- 
ents for modeling the location and abundance of satellite galaxies: 
Subhaloes that were lost by the cosmological simulation at high 
redshift, but can still host galaxies at z = 0, are identified using 
an estimate for dynamical friction. The location of unresolved sub- 
haloes is fixed using either the location of the most bound particle 
of the last identified subhalo, or using an analytical model with a 
free parametrization. 

Once the population of galaxies is broken into two sub- 
populations, the model cannot be constrained by the abundance of 
galaxies alone. The number of models that can fit the observed stel- 
lar mass function of galaxies is infinite. We therefore check a very 
large number of models (~ 10^^) and look for those that fit both the 
observed stellar mass function, and the observed auto-correlation 
function of galaxies. We will show that even when using these two 
constraints, there is a significant amount of freedom in the relation 
between and A/finfaii. Satellite galaxies might occupy a signifi- 
cantly lower value of A/infaii and thus might be more abundant than 
central galaxies at a fixed rrii,. The dependence of satellite galaxies 
on the halo mass is able to compensate for this effect in terms of 
clustering, and is crucial for fitting the observed auto-correlation 
functions. 

The observational constraints used in th is work include th e 
stellar mass function of galaxies, as derived bv lLi&Whit3(l2009l) . 
For the CFs o f galaxies we use the methodology p resented by 
iLi et alj l l2006h . using the same stellar masses as in iLi & White! 
( l2009n . These stellar masses are based on red shift and five-band 
photom etric data, as described in detail by iBlanton & RoweisI 
(12007). The galaxy sa mple is based on the final data release (DR7; 
Abazaiian et'ai]|2009l) of the Sloan Digital Sky Survey (SDSS; 
York et al. l l2000l)T Although there might be both random and sys- 
tematic deviations in the stellar masses used here, the two observa- 
tional constraints are self-consistent. 

This paper is organized as follows. Section |2|describes in de- 
tail the approach we use in this work. We elaborate on the new 
ingredients used here, and the way we implement the models. In 
section[3]we demonstrate the results of our formalism by showing 
a few different models that fit the data well. Our method for scan- 
ning the parameter space is discussed in section |4] where we ex- 
plain how the parameter space is sampled, and how we select good 
models. In this section we also show the properties of the success- 
ful models. Lastly, we summarize our results and discuss them in 
sectionjS] 



2 THE FORMALISM 



We define 'satellite subhaloes' as all the substructure clumps within a 
FOF group, except the most massive one. 



In this section we describe the formalism developed for modeling 
the abundance and clustering of galaxies. In general, we assume 
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that the stellar mass of galaxies depends solely on the properties of 
the host dark-matter haloes. This assumption is similar to the abun- 
dance matching approach (e.g. IVale & Ostrikerll2004 . However, 
we significantly extend the ingredients of the model with respect 
to previous studies. This is motivated by t he analysis done in pa- 

S ier I, based on the semi-analytic models of iNeistein & WeinmannI 
The additional ingredients are related to the assumptions re- 
garding satellite galaxies: their abundance, location, and the effect 
of the host halo mass on their stellar mass. 

We start by sunmiarizing the various ingredients of the for- 
malism, a detailed description of each component is given in the 
following sub-sections. 

(i) All the subhaloes from a cosmological A'^-body simulation at 
z = Q are selected. 

(ii) In addition, we use subhaloes that have been merged into a 
bigger structure at high redshift, but might host galaxies that sur- 
vive until z = Q- This is due to the effects of dynamical friction 
and stripping on galaxies, which are not included in the A*'-body 
simulation. 

(iii) We use a freely tunable prefactor in the dynamical friction 
formula. 

(iv) We assign one galaxy with a specific stellar mass (m*) to 
each subhalo from the full sample defined in items (i) & (ii). 

(v) For central subhaloes within their FOF group, we assume that 
the stellar mass of galaxies depends on the host subhalo mass at 
z = only (Minfaii). 

(vi) For satellite subhaloes, we assume that the stellar mass of 
galaxies depends on two parameters: the host subhalo mass at the 
time of infall, and the host halo mass at z = 0. 

(vii) We allow the dependence of stellar mass on the subhalo 
mass to be different for central and satellite subhaloes. 

(viii) The location of subhaloes that are not found in the sim- 
ulation at z = is set by either the location of their most bound 
particle, or by an analytical model. 

Items (iii), (vi), (vii), (viii) in the list are new with respect to previ- 
ous ABM models, and their influence on clustering and abundance 
of galaxies has not been tested before. 



2.1 Subhaloes and merger-trees 

We use merger trees extracted from the Millennium A'^- 
body simulation dSpringel et al.l l2005h . This simulation was 
run using the cosmological parameters (rim, i^A, h, erg) = 
(0.25, 0.75, 0.73, 0.9), with a particle mass of 8.6 x 10* h'^MQ 
and a box size of 500 /i~^Mpc. The merger trees are based on 
snapshots spaced by ^250 Myr, linking su bhaloes identified by the 
SUBFIND algorithm jSpringel et al.ll200lh . Su bhaloes are define d 
as the bound density peaks inside FOF groups ( jPavis et"al]|l985h . 
More de tails on the simulation and the subhalo merger -trees can be 
found in lSpringel eTal] | |2005|) and lCroton et al.l(l2006h . 

The subhalo mass, M^, corresponds to the number of particles 
inside a subhalo, as identified by SUBFIND. The infall mass of the 
subhalo, Minfaii, is defined as 



if central within its FOF group 



Here Zinfaii is the lowest redshift at which the main progenitoi0 of 
the subhalo Mh was the most massive within its FOF group, and 
M/i_p is the main progenitor mass at this redshift. 

In addition to A/infaii we will use the halo mass i\f200, which 
is defined as the mass within the radius where the halo has an over- 
density 200 times the critical density of the simulation. In general, 
the mass M200 should include both the central subhalo within a 
group, and all of its satellite subhaloes. However, due to the spher- 
ical symmetry forced on i\/200 and the specific over density being 
used here, it might deviate from the exact FOF group mass. In what 
follows, we will refer to K'hm as the 'halo mass' to indicate that 
this mass is computed over a larger spatial region than the subhalo 
mass. For satellite subhaloes, we assign the same value of Af200 as 
is computed for their central subhalo within the same group. Due 
to the effects above, the value of Afinfaii is often similar or higher 
than A/200 for central subhaloefl 



2.2 Satellite subhaloes: dynamical friction and location 

Satellite subhaloes are defined as all subhaloes inside a FOF group 
except the most massive (central) subhalo. Within the A'^-body sim- 
ulation, satellite subhaloes lose their mass while falling into a big- 
ger subhalo, and thus might fall below the resolution limit used 
in the SUBFIND algorithm (20 particles for the Millennium simula- 
tion used here). However, the galaxies residing inside these satellite 
subhaloes might live longer, as they are more dense, and thus less 
vulnerable to stripping. This effect is significant even for relatively 
high resolution cosmological simulations, as was shown in paper 
I. It can modify the abundance of subhaloes even at two orders of 
magnitude above the minimum subhalo mass resolved by the sim- 
ulation. 

In order to take this effect into account, we model the time it 
takes the galaxy to fall into the central galaxy by dynamical friction. 
At the last time the satellite subhalo is resolved we compute its 
distance from the central subhalo (rsat), and estimate the dyna mical 
friction time using the formula of lBinnev & TremainddigSTh . 



tdf = Odf ■ 



1.17K,rL 



GMh.2^n{l + Mh.i/Mh,2) 



(2) 



Here Mh,i is the mass of the central subhalo, Vv is its virial veloc- 
ity, and Mh,2 is the mass of the satellite subhalo. Once a satellite 
subhalo falls together with its central subhalo into a larger group, 
we update tdf for both objects according to the new central subhalo. 

The dynamical friction estimate is computed using the mass 
of subhaloes only, before galaxies are being modeled. There- 
fore, the formula deviates from its proper definition, as it does 
not include the effect of the galaxy mass on dynamical fric- 
tion. However, since this formula uses various simplified assump- 
tions with a larger uncertainty (for example, the exact trajectories 
of th e satellite subhaloes are ignored, see iBovlan-Kolchin et al.l 
l2008i) . one general constant, a^f, is being used here to ab- 
sorb all the related uncertainties. Semi-analy t ic models often 
use Odf ^ 2 - 3 JPe Lucia & Blaizo^ 120071 : iKhochfar & silS 
I2OO9I : INeistein & Weinmanrj|2010l) . in agreem ent with more de- 



A^infall = 



(1) 



A/h,p(zinfaii) otherwise 



tailed studi es of dynamical friction p rocesses (Colpietal, 
rBovlan-Kolchin et alj l2008l : Ijiang et aL .200a : ,Mo et ah 



^ Main-progenitor histories are derived by following back in time the most 
massive progenitor in each merger event. 

For central subhaloes, Af200 is slightly smaller than A/infaii- The aver- 
age difference is ~ 0.08 dex, with an RMS scatter of ~ 0.06 dex 
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Log M200 

Figure 1. The abundance of satellite subhaloes, t/is , as a function of Mj^f^ii 
and M200 . using the full Millennium simulation with a^f = 3. Contours 
show constant values of (p^ as labeled in units of Log Mpc~'^ dex~^. 0s 
is bigger than zero only below the dotted-dashed hne. The thin dashed line 
corresponds to A/infall = M200 and is shown as a reference. 



Since we ignore contribution due to the galaxy mass, we allow 
more freedom in Odf as is discussed below. 

To summarize, three types of subhaloes exist in our models: 

• central subhaloes: most massive subhaloes within their 
FOF group. 

• satellite subhaloes: all subhaloes except central subhaloes. 

• unresolved subhaloes: subhaloes that were last identified at 
high z, and are added according to the dynamical friction formula 
above. All the unresolved subhaloes are also satellite subhaloes. 

The location of unresolved subhaloes is not given by the simu- 
lation. We therefore need to estimate the location of the galaxies 
that are assigned to these subhaloes. This issue was treated by e.g. 
ICroton et alj ( l2006h who used the location of the most bound par- 
ticle of the last identified subhalo, as is given by the dark-matter 
simulation at z = 0. The same location is adopted here as well. 
However, we would like to take into account the uncertainty in- 
volved in this method. Obviously, one dark-matter particle inside 
a collisionless A'^-body simulation cannot mimic accurately the lo- 
cation of an extended galaxy, with a possibly very different mass. 
We therefore use the following analytical model as an additional 
option: 



= rs.t (1 - r") 



1/9 



(3) 



Here r is the fraction of time spent out of all the estimated dynam- 
ical friction time until z = 0, r is the distance we adopt at z = 
from the central subhalo, and p, q are constants. In appendix IaI we 
derive this model by following the angular momentum of a parti- 
cle inside a spherical gravitational potential of the type p oc r^^. 
We show that for a specific values of p and q, the model is able 
to reproduce the location of subhaloes as is modeled by the most 
bound particle. This parametrization enables us to check various 
modifications to the location of unresolved subhaloes. 



2.3 Counting subhaloes 

The usual way to implement ABM is to first populate subhaloes 
with galaxies, and only then to compute the auto correlation func- 
tions (CFs) of galaxies. In our approach there are many possible 
models, mainly depending on the different mass relations for satel- 
lite and central galaxies. Since computing the statistics of pairs is 
the most demanding computational step, it is not possible to scan 
a significant number of models in the usual technique. Here we 
present a new way of computing the CFs which is extremely effi- 
cient when many models are needed. We first compute the statis- 
tics of subhalo pairs, and save them as a function of Minfaii and 
A/200- Only then are the galaxy CFs computed. In this section we 
explain how the statistics of subhaloes is defined and computed, the 
next subsection discusses the way this information is used to model 
galaxies. 

We want to constrain our models against the stellar mass func- 
tion of galaxies. In order to do so we will use the mass function of 
central and satellite subhaloes: 



V dlog A'/infall 
1 



, Af 200 ) = 



(4) 



(5) 



V dlog Afinfaii dlog A//200 

Here V is the volume of the simulation box, and A^*^, A'^' are the 
numbers of central and satellite subhaloes respectively. In Fig. [T] 
we show the two dimensional mass function 0s , for a model with 
Qdf = 3. A similar behaviour as presented here is valid for 0.1 ^ 
Qdf ^ 10, where the low-mass contour can vary by ~ 0.5 dex, and 
the contour of massive subhaloes is hardly affected. 

In order to compute the CFs of galaxies, we start by computing 
the number of subhalo pairs, Np, for the total sample of subhaloes. 
In case both subhaloes within the pair are central subhaloes, we 
count this pair into tpcc' 

.-ts 

V'cc(A/iifall, A/fnfall,r- 



1/2 dlog A/; 



nfalldlogA/2j^,l dlogr 



(6) 



nfaii' Afj'^fjjii are the infall mass of the first and second 



Here A4 

subhaloes in the pair, and r is the distance between these subhaloes 
within the x-y plane. Distances are computed only within the x-y 
plane in order to compute the projected auto-correlation function, 
as is described below. In practice we divide the range in LogAfinfaii 
and Log(r) into 100 and 50 bins respectively, and save t/jcc as a 
multi-dimensional histogram. 

In a similar way we define the pair statistics of central- 
satellite, and satellite-satellite subhaloes. 



■i/'ss ( Afi^nfall , Afhifall , Af200 , Af|oO , : 
^cs(Afinfall, Afj^ifall, M200 , r) . 



(7) 
(8) 



Note that for satellite subhaloes the number of pairs is saved as a 
function of both Afinfaii and Af2oo ■ This is done in order to prop- 
erly model the dependence of stellar mass on Af200 • It should be 
emphasized that the statistics of satellite subhaloes, i.e. (jjs, ipsa, 
t/jcs, depend on the dynamical friction constant, Qdf, and on the 
location we adopt for unresolved subhaloes (p and q from Eq.|3}. 



2.4 Definition of a model - domains in stellar mass 

Previous studies have often used an analytic functional form 
to describe the rel ation between Afinfaii and m*. For example, 
iMoster et al] ( 1201 Ol) suggested: 
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Figure 2. The dependence of on (Mi„fjjii,M2oo) for satellite galaxies 
according to model A. This model uses ajf = 3,p = 0.5, g = 0.8. The 
solid lines show IJf that obey the equation / (r//(M2oo)i ^20o) = 
where m*,i = 9.27, 9.77, 10.27, 10.77, 11.27, 1 1 .77 in units of log Mq . 
Diamonds are placed at the median A/i,ifaii values along each line. The 
dotted-dashed line and the thin dashed line are the same as in Fig.[T] 



/ (Minfall) = ClAfinfall [c2Mi^f^ 



(9) 



where Ci are all constants. A straigtit forward way to extend this 
approach here would be to parameterize / as a function of two 
variables, m* = /(Minfaii, M200), and to use a different set of 
parameters for modeling central and satellite galaxies. However, 
this approach requires a priori knowledge of /, and the resulting 
solutions might be restricted by the specitic functional form chosen. 
This is especially true when / is allowed to be different for satellite 
and central galaxies, so the freedom in its functional shape might 
be larger than what was found in previous studies (see the examples 
in section|3]below). 

Here we suggest a new way to parameterize the relation be- 
tween Minfall and m*, which is motivated by the observational 
data. The observed CFs of galaxies are computed over mass bins 
of width 0.5 dex in stellar mass (hereafter 'domains'). For exam- 
ple, the CP for small mass galaxies is based on galaxies with stellar 
mass in the range 



Domain 1: [10" 



10" 



(10) 



In order to model the CP in this range we only need to know the 
values of A/infaii , A/200 that coTOspond to m* = 10^'^^, 10^'^^. 
These are defined by: 



/ (Min 



,Af2 



10" 



Afin 



,Af2( 



lO^ '^'^ . (11) 



Such constraints imply that in case of a smooth and monotonic 
/, the 'boundaries' of the domain coixespond to curves within the 
Minfall, M200 plane. The relation that defines a boundary can thus 
be written as: 



Afinfall = Ul (Af200) , A/in 



U2 (A/200) 



(12) 



where Ui is the curve function. All subhaloes that are located be- 
tween Ul & U2 (their A/infaii, A/200 masses follow the constraint 
(7i(A/2oo) ^ Minfall ^ f/2(M2oo)) should Contribute galaxies to 
the CP of the first domain. 

The standard abundance matching approach assumes that m* 
depends only on A/infaii. In terms of our language, this means that 



Figure 3. The relation between m^, and Minfaii in model A. For satel- 
lite subhaloes we plot various values of Afinfaii per a given m * , including 
median (thick dashed lines), one standard deviation from the median (thin 
dashed lines), and the full range (thin dotted-dashed lines). The thin sohd 
line corresponds to the universal baryonic fraction, m* = 0.17A/infaii. 
Model A violates this fraction limit, and thus will not be used in the rest of 
the paper. 



the functions Ui's are all constants, with no dependence on M2oo. 
In this work, we extend this assumption in two ways: we assume 
that Ui might be a power-law for satellite galaxieQ and that for 
each domain there might be a different Ui for central and satellite 
galaxies. This can be summarized as: 



Central subhaloes: Ui 
Satellite subhaloes: Ui 



(13) 



Since the observed CPs are based on 5 domains in stellar mass, 
we need to specify C/^'s at 6 domain boundaries. Our model there- 
fore includes 6 free parameters (oi) for central subhaloes, and ad- 
ditional 12 parameters (/3i, Si) for satellite galaxies. We choose to 
limit our models only to power-law dependence on A/2()o follow- 
ing the results of paper I, and as a first order approximation. It will 
allow us to test how important the standard assumption of constant 
Ui is. It might well be that a more complex behaviour would add a 
significant amount of freedom to the models. 

In order to model the stellar mass function, we will use the 
same domains, and demand that the stellar mass function will 
be reproduced once integrated over each domain. This guarantees 
that in case it is needed, a detailed solution of the type m* — 
/(A/infaii, A/200) exist. However, since our models allow the mass 
relation above to deviate between central and satellite galaxies, 
the detailed behaviour of m* within each domain is not well con- 
strained. In general, there might be many different interpolations 
of the kind m* = /(A/infaii, A/200) within each domain (between 
adjacent Ui's). These will not change the computed CP, and will 
fit the observed stellar mass functions. The range in stellar mass 
for each domain is relatively small, so this freedom is negligible in 
comparison to the results we will show below. 



^ For central galaxies the values of A/200 are always very similar to 
A/infall so there is no added value in allowing the stellar mass of central 



galaxies to depend on A/200 . 
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In general, modeling a scatter in m* for a given Minfaii and 
M200 is possible within our formalism. However, it demands a de- 
tailed knowledge of the functional form, m* = /(Afinfaii, Af2oo). 
This addition does not allow us to scan the different models in a 
very efficient way. We therefore do not treat such a scatter in this 
work. Nonetheless, the dependence of m* on M200 for satellite 
galaxies results in a variation in m, as a function of Minfaii. This 
effect will be discussed below. 



2.5 How to compute CFs? 

To summarize, each model in this work is defined by the following 
parameters: 

a) The value of Qdf used in the dynamical friction formula, as 
defined by Eq.[2] 

b) The values p, q used in Eq.[3]for modeling the location of un- 
resolved subhaloes. Alternatively, we use the location as given by 
the most bound particle. 

c) For central subhaloes, 6 values of cti that define the domain 
boundaries C/f (Eq|13l>. 

d) For satellite subhaloes, 6 values of Pi, and 6 values of Si that 
correspond to the Ui boundaries. 

Once the parameters aaf, p, q are chosen, we construct the 
subhalo statistical functions (j) and 7/;. The parameters in items (c) 
and (d) above are then used as integration limits for (j) and i/i. By 
integrating 4> we compute the total number of subhaloes within the 
domain: 



N = V 



V 



tic dlog MinfaU + 



, dlog A/infaU dlog M200 . 



(14) 



A similar integration of i/; over each domain results in Np{rp) - 
the total number of pairs within each radial bin rp. The projected 
auto-correlation function, Wp{rp), is then defined as the deviation 
in the number of pairs from the average value per volume: 



Up{rp) = 



Np{rp) 



An 



(15) 



Here Ap is the 2-dimensional area covered by the radial bin rp, and 
L is the size of the simulation box in h^^Mpc. 

We have tested numerically that our methodology agrees with 
the standard ABM approach for various different models. In our 
approach the models are based on quantifying (p and -0 only, so we 
do not construct a full realization of galaxies per each model. This 
is in contrast to standard ABM models, where a list of all subhaloes 
within a given simulation is necessary. 



3 A FEW EXAMPLES 

In this section we present a few specific models that were found 
using a large parameter search within our formalism. The details of 
the search will be presented in the next section. Here we discuss a 
few examples to demonstrate the parametrization needed for each 
model, and to highlight the variety of models that are able to fit 
both the stellar mass function and CFs. 

In Figs.[2]&[3]we present model A, one example out of a fam- 
ily of models which are extreme with respect to the full population 
of models. In this model, m* is much higher for satellite subhaloes 
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Figure 4. Same as Figs. |2]&ll]but for model B. Here = 3, and the 
location of um'esolved subhaloes is set by the most bound particle. 



than for central subhaloes, for a given Afinfaii. The difference can 
reach a factor of 100 in stellar mass. As can be seen from Fig. |2] 
this model includes a relatively strong dependence of on M200, 
a feature that was originally seen in the SAMs analyzed in paper I, 
although less prominent. We note that previous ABM models were 
using only horizontal lines in the M200 — Afinfaii plane. 

The CFs of model A are plotted in Fig.[TO] showing a reason- 
able match to the observed data, with an RMS deviation of 0.2 dex 
(more details on how we define this deviation can be found in sec- 
tion |4.1| l. Interestingly, the same model fits the stellar mass function 
well (see Fig.|7]l. These results suggest that even though model A is 
extreme, it is broadly consistent with the observational constraints 
adopted here. However, Fig. [3] shows that this model violates the 
limit m* < O.lTAfinfaii for satellite subhaloes. Assuming all 
baryons within each subhalo are converted into stars, the maximum 
stellar mass should equal m* — Qb^^^ Mi^un = O.lTA^infaii- 
Model A exceed this limit, and is therefore rejected and will not be 
considered as a valid model in what follows. It is plotted here to 
demonstrate that using our formalism, fitting both the stellar mass 
function and the CFs is not enough for constraining the maximum 
stellar mass per a given Afinfaii. 

Models B and C are plotted in Figs.|4]&|5] and show a very 
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Figure 10. The projected auto-correlation functions (CFs) derived for models A,B,C, D. Each panel corresponds to galaxies with stella r masses as indi cated 
by the range of LogA/Q. Lines show the results of the models. The observational data are using SDSS DRV with the same technique as in iLietalJiioO^ . and 
are shown as error bars. Models B, C, D fit the observational data to a level of 0.1 dex RMS, while model A deviates at the level of 0.2 dex (details on the 
way these errors are computed can be found in section |4~TI . 



different behaviour for low mass subhaloes. The difference for a 
given m* between these two models can reach a factor of 10 in 
-Minfaii for central and satellite subhaloes. Model C shows a steep 
dependence of m, on Minfaii for low mass central subhaloes. This 
means that for a given subhalo mass, the difference between m* 
for satellite and central galaxies might be very large, more than a 
factor of 10. 

We emphasize that our definition of a 'satellite' versus 'cen- 
tral' is valid only for subhaloes. It might be that satellite subhaloes 
will host more massive galaxies than the central subhalo within the 
same group. In these cases the more massive galaxy in the group 
might be identified as a 'central' galaxy, although its host sub- 
halo is defined as a 'satellite' here. In a recent studv. Iskibba et all 
( |20T l|), have estimated the fraction of haloes that host central galax- 
ies that are not the m ost luminous in th eir group. Using a group cat- 
alog based on SDSS jYang et al.l2007h they found that this fraction 
reaches 25 (40) per cent for low (high) mass haloes. Thus, a differ- 
ent identification of 'central' and 'satellite' may be used either in 
observational studies, or in the analysis of hydrodynamical simu- 
lations. In our models, the difference between satellite and central 
subhaloes is defined solely according to the dark-matter behaviour, 
and is motivated by the different merger history of subhaloes, and 
their final location. 

From Fig. |5] it is evident that model C predicts a non- 
negligible number of low mass subhaloes that host galaxies with 
very high stellar masses, including galaxies with ~ 0.6 • 



0.17A'finfaii- This shows again that our formalism cannot constrain 
the mass fraction that is locked in stars, in contrast t o what has been 
claimed using more s implified ABM models (e.g. iBehroozi et al] 
l20ld : lGuoetalj201(]|) . 

Model C is a good example of a model with a very high abun- 
dance of satellite galaxies. Although the number of satellite sub- 
haloes is smaller than central subhaloes at a given Minfaii, differ- 
ent m, — Mintaii relations for central and satellite subhaloes might 
result in a high abundance of satellite galaxies. We plot the satellite 
fraction (the number of satellite galaxies out of all the galaxies of a 
given m-t:) of all the models in Fig.[8] The satellite fraction changes 
significantly between the models. For model C this fraction reaches 
unity at low stellar masses, while for models B and D it reaches a 
maximum value of ~ 0.3 and 0.4 respectively. 

Model D is shown in Fig. [6] It is an example for a model in 
which m* for satellites does not strongly depend on i\finfaii- In gen- 
eral, the slope of the — A/infaU relation, and the location where 
it turns over can be related to various processes in galaxy formation 
physics. Merger-rates, cooling, feedback and star-formation should 
all be com bined in order to repro duce the observed m* — MinfaU re- 
lation (e.g. IShankar et al]|2006l) . Unless our more extreme models 
like model D are ruled out by other observational constraints, our 
results indicate a large amount of freedom in modeling the above 
processes. 

The stellar fractions, m*/Afinfaih for the few models pre- 
sented here, are plotted in Fig.|9] The halo mass at which the global 
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Figure 5. Same as Figs. [2] & [H but for model C. Here a^f = 3,p 
0.5, g = 0.8. 



Log Minfall 

Figure 6. Same as Figs. |2]&[l] but for model D. Here cx^i = 3, and the 
location of unresolved subhaloes is set by the most bound particle. 



stellar fraction reaches a maximum can range from ~ 3 x 10 
to ~ 3 X 10^^ Mq . In model D this efficiency is approximately 
constant for satellite galaxies as a function of Afinfaii, showing no 
global strong peak. 

The CFs of all the models are shown in Fig. [TO] Models 
B,C,D fit the observed CFs to a good accuracy, below 0.1 dex 
RMS, while model A is less accurate, reaching a level of 0.2 dex 
RMS. This demonstrates the importance of using this constraint 
here, in order to narrow down the range of accepted models. As 
can be seen from Fig. [7] all the models fit the stellar mass function 
well. Another point that should be emphasized regarding Fig.llOlis 
the small scale (< IMpc) clustering of massive galaxies. The CF 
of different models shows variations at this regime, so it might be 
that extending the CF of the most massive domain into smaller radii 
will help to constrain the models. This will require a survey volume 
much larger than in the current SDSS survey. 
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Figure 7. The stellar mass functions of gala xies using models A , B, C, D 
(symbols). The observed function derived bvlLi & White! J200^ is plotted 
in dashed line. Symbols ai'e placed at the centre of each domain. 
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Figure 9. The ratio between stellar mass and A/infaii for models B, C, D. 
Satellite subhaloes are plotted as dashed lines, and are using the median 
values for a given as shown in Figs.|4]|5] and|6] Thin, medium, and thick 
lines correspond to models B, C, D respectively. All lines are artificially 
smoothed within the m* domains for a better view. The peak in the stellar 
fraction can change by more than a factor of 10 between different models, 
and between satellite and central subhaloes. 

4 RESULTS 

4.1 Search strategy 

Each model within our formalism is defined by 21 free parame- 
ters, where 18 of them define the domains in stellar masses (C/f 
& Uf from Eq.|13t used to compute the CF, 1 parameter fixes the 
time-scale for dynamical friction, and 2 parameters are responsible 
for the location of unresolved subhaloes. We were able to scan all 
possible options of the 18 parameters that govern the domains be- 
haviour (within the resolution adopted here). In addition, we have 
tested five Odf values in the range 0.1-10, and a few options for 
setting the location of unresolved subhaloes (see Appendix lAl and 
Table lAll for more details). In total, we have tested the stellar mass 
function for ~ 10^^ models, approximately lO'^ out of them were 



tested against the observed CPs. More numerical details regarding 
the search algorithm can be found in Appendix IbI 

The values of and t/j are saved numerically in fine bins. The 
choice of bin size is important for several reasons. First, the func- 
tion ^pas depends on 5 different variables, so the number of bins 
we use for each variable is limited because of computer memory 
issues. Second, fine bins will require more evaluations of CFs, as 
the code checks automatically all possible options. This can slow 
our code dramatically. On the other hand, the bin size limits the 
search resolution; too big bins will not allow us to find all possible 
solutions. As a compromise between these requirements we have 
chosen the following bin sizes: Minfaii is split into bins of 0.02 
(0.1) dex for subhaloes with mass smaller (bigger) than 10^^ Mq, 
data per r and Af200 are saved in bins of 0.1 dex. We use small 
bins for low mass Mintaii because the typical difference between 
adjacent Ui is smaller for small mass A/infaii (see e.g., model C 
above). 

As described in section l2!4l the domain boundaries are defined 
using power-law relations. Writing Eq [13] in terms of Log mass 
gives log Afinfaii = log /3i + Si log M200 . Our search algorithm 
checks all the possible values of Pi according to the bins of A/infaii. 
Si is modeled in terms of the line slope. Si = tan 9i. We sample 6i 
in steps of 6.75 degrees, over the range [—90, 45]. 

Each model is accepted if it follows the conditions below: 

• The model fits the stellar mass function of iLi & Whit3 ( l2009t) 
with an accuracy that is better than 20 per cent. This criterion is 
applied separately to each domain by integrating the stellar mass 
function over the domain range (0.5 dex in m*). We have checked 
that the range of models shown below changes in a very minor way, 
when demanding a bett er fit that resembles the statistical errors 
from iLi & Whit3 ( |2009|) (these are 5,5,5,10,20 per cent for each 
domain, in order of increasing mass). Here we choose to use a con- 
stant accuracy of 20 per cent to account for systematic uncertainties 
in the stellar mass f unction (compar i sons against other m easure- 
ment can be found in lGuo et~al ]|2010l : iBemardi et aLll2010h . 

• The model fits the logarithm of the observed CF (see Fig.llOt 
to better than an RMS value of 0.1 dex (26 per cent). This esti- 
mate is based on all points in the range 0.03 < r < 30 Mpc h^^, 
sampled in bins separated by 0.2 dex in Log(r-). In order to test 
the effect of this fit accuracy we also show results using 0.2 dex 
deviation. A detailed discussion of these issues is given below. 

• The model does not include individual points that deviate from 
the observed CF to more than a factor of 2. 

• The stellar mass function is fitted also for masses larger than 
the most massive domain (i.e. for m* > 10^^ '''' A^q). This domain 
should include only 23 galaxies, when using the full Millennium 
simulation. Since this is a very small number, we allow our mod- 
els to deviate an additional Poisson error from the nominal value. 
This means that the number of galaxies with masses larger than 
lO^^'^'^ Mq can have any value in the range [14, 34]. 

• The stellar fraction (i.e. m*/A/infaii) does not exceed the uni- 
versal fraction of 0.17 for all subhaloes in the sample. 

We adopt an accuracy of 0. 1 and 0.2 dex for fitting the CPs due 
to the following reasons. First, the Millennium simulation being 
used here is based on a cosmological model with as =0.9, higher 
than the most updated measurements of erg = 0.8 jjarosik et al.l 
I20T J). This should give rise to some deviation between our models 
and observations. We find that there is a minimum deviation of 0.08 
dex between all the models discussed here, and the observed CPs. 
Since this deviation is dominated by scales larger than 1 Mpc, it is 
probably related to the different cosmological model assumed. Sec- 
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Figure 11. A summary of all the models that fit the observed stellar mass 
function and the auto correlation functions of galaxies at z = 0. Lines 
show the envelope of all relations between and Mj^f^n (we use median 
values of A/infall psr ^ given m* for each model). The results are separated 
into central and satellite subhaloes. Thin dotted-dashed line coiTesponds to 
the median A/infill for all satellite galaxies residing in haloes more massive 
than 10^^ Af . The models reproduce the observed stellar mass function to 
better than 20 per cent, and the CFs to better than 0.1 dex RMS. The thin 
solid line shows the — Afinf^ii relation using the same behaviour for 
satellite and central galaxies, with no dependence on M200 ■ 

ond, due to the finite bins that are used for saving V', the accuracy 
of our search algorithm is approximately 0.03 dex. As a result of 
the above, our minimum fit criterion is chosen to be 0.1 dex. How- 
ever, since the error bars presented for the observed CFs are only 
statistical, it might be that systematic uncertainties would introduce 
errors that are larger than 0. 1 dex. The CFs depend strongly on var- 
ious galaxy p roperties like color, star-formation, and morphology 
jLi et alJboOfil) . Consequently, if the stellar masses of galaxies are 
systematically biased for galaxies of a given property (for example 
the high star-forming galaxies), this could change the CFs consid- 
erably. We therefore additionally consider a fitting criterion of 0.2 
dex, to demonstrate the effect of the possible systematic uncertain- 
ties. 

The minimum subhalo mass (~ 6.3 X 10^° A/q), enforced 
by the Millennium simulation seems to limit our models (see mod- 
els A and C above). It might be that using simulations of higher 
resolution will permit more models. The Millennium II simulation 
( iBovlan-Kolchin et aljlioosl) is a good candidate for such a study. 
However, high resolution simulations are naturally based on much 
smaller volume than what is being used here, resulting in a smaller 
statistical sample and non-negligible cosmic variance effects. Since 
our study is aiming at fitting the CFs to a high accuracy of 0. 1 dex, 
we focus our study only on the Millennium simulation. 

4.2 The mass relation 

The main results of the parameter search are summarized in 
Figs. [TT] and [T2] In case we force the models to reproduce the ob- 
served CFs to a high accuracy of 0.1 dex, the range of accepted 
models occupies a region of ~1 dex in the m* — Afinfaii plane, as 
shown in Fig.[TT] Interestingly, the uncertainty is small for central 
massive subhaloes. On the other hand, low mass central subhaloes 
can host galaxies with very low stellar masses. The range of ac- 



Figure 12. Similar to Fig. Illl but allowing the models to deviate from the 
observed CFs by up to 0.2 dex RMS. Results from weak tensing analysis by 
iMandelbaum et ai] )2006h are plotted in symbols. Squares and circles refer 
to the mean value of early and late type galaxies respectively, with error bars 
that reflect 95 per cent confidence level. The fraction of late-type galaxies 
out of the full sample is 0.74, 0.60, 0.46, 0.32, 0.20, 0.1 1, 0.05 (ordered in 
increasing m*). 



cepted models increases significantly in Fig. [12] where the accuracy 
of fitting the CFs is set to 0.2 dex. We have checked that the dis- 
tributions of models within the plotted envelope in Figs. ll lll2l are 
roughly uniform, so the range of models is not affected by outliers. 

The number of models presented in Fig. [TT] and [12] is quite 
large. Each domain includes ~ 10'' successful models in Fig. [TTI 
and ~ 10® models in Fig. [12] We only take into account domains 
for which nearby Ui from adjacent domains coincide, and the set 
of all 5 domains covers the full mass range. Due to the above, we 
compute the median A/infaii for a given m*, which does not de- 
mand combinations of domains of different m*. Computing the 
opposite relation, median m* per a given Afinfaii, is much more 
complicated within our formalism. However, since the relation for 
central galaxies does not include any scatter, it represents a median 
m* per a given Afinfaii as well. 

In our models, the stellar mass of satellite galaxies depends on 
their host halo mass, which correlates with the number of neigh- 
boring galaxies in observational studies. As a result, the range in 
the m* — A/infaii relation changes as a function of the host halo 
mass. This is shown in Fig.[TT]by plotting the envelope of all me- 
dian relation in the m* — Afinfaii plane, but taking into account only 
galaxies that reside in clusters (their host halo mass is bigger than 
10^^ Mq). For these satellite galaxies, the range in A/infaii for a 
given is larger than the one for all galaxies. 

Fig. 1121 includes the results of weak lensing analysis from 
IMandelbaum et alj feOOd) . These estimates are for the host halo 
mas central galaxies, and should be compared to the solid lines 
of our models. The square symbols represent a subsample of early 
type galaxies, while the circles correspond to late-type galaxies. As 



^ IMandelbaum et all )2006l) define the halo mass as the mass enclosed 
within a radius that coiTespo nds to 180 times the m ean density, this mass 
is higher than Af200 (see also lWeinmann et alj200d) . but agrees quite well 
(< 0.1 dex) with A/infaii for central subhaloes used here. 
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is mentioned in the figure caption, the fraction of late-type galaxies 
is very small at the most massive stellar mass bin (0.05). 

Surprisingly, our results deviate from those of 
iMandelbaum et al.l ( |2006|) at the high mass end, even when 
the range of models is large, using an uncertainty of 0.2 dex in 
matching the CF. This is simi lar to what was found by previous 
ABM studies, summarized bv iBehroozi et al.l ( l2010h . Apparently, 
for a given value of m*, the weak lensing results constrain the 
host halo mass by providing mainly an upper threshold (the low 
values of the host halo often reach the universal baryonic fraction). 
The opposite seems to be true for central galaxies in our analysis. 
Deviations between the two studies can be due to various effects: 



• IMandelbaum et af] j2006l) use a specific halo model for com- 
puting the lensing signal, which differs from the set of models being 
used here. 

• There is some uncertainty in their study owing to the width of 
the Minfaii-distribution, for a given m*. 

• T heir stellar mass estimat es are based on [Xauffmann et al.l 
( l2003h while the observations o f lLi& White! ( I2OO9I) use the method 
of lBlanton&Roweij | |2007|). The difference between these esti- 
mates is discussed in both lLi & White! ( !2009!) and !Guo et al] \2Q\(h . 
and its effect on the stellar mass function is probably small. How- 
ever, it might be that the difference between the methods are more 
significant for computing the CFs. 

• It might be that our range of models is too narrow, or that 
there are some systematic differences between the two approaches. 
This could be due to our search resolution, the assumed underlying 
cosmological models, or the assumed IMF. 

A better, more self-consistent way to compare our results against 
weak lensing would demand a direct estimate of the lensing signal 
from our models. This can then be compared to the observed shear 
signal. Such an a nalysis is however outsi de the scope of this work. 
A recent study bv !Leauthaud et af] J2OI ih have used a model based 
on halo occupation distribution and demonstrated the strength of 
applying this addit ional constraint. 

The results of !Mandelbaum et"al] ( !2006!) indicate that the pop- 
ulation of late-type massive central galaxies live inside haloes 
with lower masses than early-type galaxies. The difference be- 
tween early and late type galaxies indicates that the scatter in the 
m* — Mj nfaii relation might be significant. A large scatter was also 
found bv !More et al.! l !201 1!) , based on satellite kinematics. We do 
not model such a scatter for ce ntral galaxies in this study. Previous 
works (e.g. !Moster et al.!|2010l) have shown that including a scatter 
modifies the m* — Mjnfaii relation at the massive end, such that 
for a given Mjnfaii, tti* decrease with i ncreas ing scatter. Interest- 
ingly, the results of !Mandelbaum et alj l !2006!) reach the universal 
baryonic fraction, m*/Minfaii ~ 0.17, for part of the galaxy pop- 
ulation. This is very similar to what is found here, and can be seen 
in model C (Fig.|5}. 

In Fig. [T3] we plot various sub-samples of the models from 
Fig. [TT] showing the influence of different parameters on the rela- 
tion between m* and i\/intan. Our results agree with previous stud- 
ies in predicting a very tight — Afinfaii relation, once satellite 
and central galaxies are considered to have very similar median re- 
lations (at the level of ~ 0.2 dex), and M200 does not affect m*. 
This tight relation holds even though the models include the new 
ingredients related to Qdf and the location of satellite subhaloes. 
This proves that the dependence of m* on M200, and the differ- 
ence between central and satellite galaxies, are responsible for the 
range of allowed models shown in Figs. II 11121 

It is worth noting that our models do not include solutions that 
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Figure 13. The effect of different parameters on the median relation be- 
tween m* and Afinfau- Each panel shows a selection of models out of the 
full sample, as plotted in Fig. 1111 The upper left panel takes into account 
all models for which the behaviour for satellite and central subhaloes is the 
same (±0.2 dex), and no dependence on M200 is allowed. In the middle left 
panel we plot models with a similar median Minfaii for central and satellite 
galaxies, at a given m* . The lower left panel plots models for which there 
is no dependence on A/200 ■ Models using the default value of a^jf = 3 and 
location set by the most bound particle are plotted in the upper right panel. 
The middle right panel summarizes models that use the location as given by 
the most bound particle. The lower right panel summarizes all models with 
the same dynamical friction constant, ajf = 3. 



have exactly the same mass relation for central and satellite galax- 
ies, and can still fit the CFs to a level of 0. 1 dex. The minimum devi- 
ation between central and satellite galaxies in the upper left panel of 
Fig.[T3]is roughly 0.12 dex in A/intaii. This small difference might 
be an artifact of the underlying cosmological parameters assumed 
by the Millennium simulation. 

The models shown here use the two populations of galaxies 
to compensate for each other, keeping the overall — A/infaii 
relation relatively similar. This effect is shown in the middle left 
panel of Fig. [T3] Once we force the models to have a similar me- 
dian A'/infaU for satellite and central galaxies, the range of models 
decreases, especially at the high mass end. It might be that con- 
straints on the differences between m, of the two populations, for 
a given Afinfaii, could be enforced ad-hoc. However, it is not clear 
what this limit should be. A reasonable demand would be that the 
full distribution of the two populations would overlap to some level, 
allowing a transition of central galaxies into satellites. This condi- 
tion seems to be fulfilled by our models (see Figs. I4l6t . Since we 
do not model a distribution for the m* of central galaxies, we do 
not explore this issue further here. 

The effect of Odf and satellite locations is explored in Fig.ll3l 
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Figure 14. Similar to Fig. 1111 but here the CFs are constrained to fit one 
of our models, instead of the observational data. We use as a reference the 
simplest model where satellite and central galaxies have the same relation 
between and Minfall^ ™d a^f = 3. This shows that the uncertainty in 
the median relation between and Minfaii does not depend strongly on 
the reference model, and its underlying cosmological parameters. 



It seems that both elements add freedom to our models, although 
the effect of satellite location is slightly stronger. Higher resolution 
cosmological simulations might help to constrain adf,p, g. How- 
ever, these simulations are based on dark-matter only, while the 
results here are affected strongly by the baryonic component s of 
galaxies (see e.g. lBovlan-Kolchin et aDl2008l : [jiang et alj|2008h . 

Lastly, in Fig. [141 we show the results of a parameter search 
when the reference CFs are not the observed ones, but instead are 
taken from one model out of the models tested here. The range of 
accepted models in this case is related to the internal degrees of 
freedom of the models, and is less related to the choice of the cos- 
mological parameters, the method adopted for computing m*, or to 
the specific physical assumptions used in our formalism. The fact 
that Fig.[T4]is similar to Fig.[TT]indicates that the underlying cos- 
mological model does not affect the range of acceptable models sig- 
nificantly. This is because here the reference CFs are based on the 
same cosmology, but use a specific model for the relation between 
urii, and subhaloes. As was pointed out by [Cacciato et al. ( 2009), 
the effect of the underlying cosmological model should be impor- 
tant when constraints from abundance and clustering of galaxies 
are combined with galaxy-galaxy tensing measurements. This issue 
should be further examined using an appropriate A^-body cosmo- 
logical simulation. 



5 SUMMARY AND DISCUSSION 

In this work we have studied the relation between the stellar mass 
of galaxies (m^) and the mass of their host subhaloes (Afinfaii). 
Our models are constrained by the abundance of galaxies, and their 
auto-correlation function (CF) at z = 0. We have shown that once 
the population of galaxies is broken into two sub-populations of 
central and satellite galaxies, the allowed range in the m* — A/intaii 
relation for each population reaches a factor of ~ 10. The range 
of accepted models depends on the accuracy by which the models 
reproduce the observed abundance and clustering of galaxi es. As 
was demonstrated in our previous work dNeistein et al.boiol paper 



I), a different — A/infaii relation for central and satellite galaxies 
is expected in galaxy formation models, although the strength of the 
deviation cannot be constrained easily. 

The shape of the m* — Minfaii relation has a large degree 
of freedom, resulting in a stellar fraction (the ratio between m, 
and Mintaii) that can peak anywhere between ~ 3 x 10^^ and 
3 X 10^^ Mq. It can even be close to a constant as a function of 
A^infaii for satellite galaxies. Interestingly, the stellar fraction can 
reach high values, comparable to the maximum universal value of 
0.17, for low mass satellite galaxies residing in low mass haloes. 
Although it seems unlikely that the stellar fraction would exceed 
0.17, our models are not able to constrain this ratio, and instead we 
are forced to use it as an ad-hoc constraint. 

Our models are similar to the 'abundance matching' (ABM) 
approach, and are directly relating a galaxy with a specific m* to 
each subhalo within a large A*'-body simulation. However, in com- 
parison to previous studies, we include various additional ingre- 
dients for modeling satellite galaxies. Their stellar mass depends 
on both the host subhalo mass (Mjnfaii) and on the halo mass at 
z = 0, and might differ from the stellar mass of central galaxies 
of the same AfinfaU. We include subhaloes that have merged into 
more massive subhaloes at high-z but might host galaxies that will 
survive until z = 0, due to long dynamical friction time-scales. The 
location of these 'unresolved' subhaloes is set by either the location 
of their most bound particle, or by an analytical model. We found 
that the most important ingredient for fixing the range of accepted 
models is the way stellar masses for satellite galaxies are modeled. 

A model with two populations of galaxies includes a large 
amount of freedom in matching the observed stellar mass function. 
We therefore constrain our models also against the observed auto- 
coiTelation function (CF) of galaxies. We developed a new formal- 
ism to compute the model CF based on the pair statistics of sub- 
haloes. This enabled us to scan systematically a significant part of 
the parameter space. We have tested 10^^ different ABM mod- 
els, and estimated the CF for ~ 10^ models that showed a good 
match to the stellar mass function. The accuracy by which we fit 
the CF affects significantly the range of accepted models within the 
m* — Minfall plane. Consequently, a detailed study of the system- 
atic uncertainties involved in this measurement is crucial in order 
to better constrain our models. 

The range allowed in the — Afinfaii relation might be larger 
than our prediction due to various effects. There are still a large 
number of models that we did not test here. These include com- 
plicated dependence of on the halo mass, different estimates 
for dynamical friction and satellite locations, higher resolution of 
the underlyin g A'^-body simulation , different observed stellar mass 
functions (e.g. lBemardi et al.l2010l) . and modeling a random scatter 
in m* . On the other hand, the range of models might be modified 
once we use subhaloes from an A^'-body simulation using the most 
up-to-date cosmological parameters. Previous estimates for the un- 
certainty in the m* — Afinfaii relation were emphasizing the con- 
tribution from uncertainty in the stellar mass function, resulting in 
0.25 dex (Behroozi et al. 201(3). This uncertainty should be added 
to what we find here. 

In this paper we have reproduced the abundance and cluster- 
ing of galaxies only at z = when constraining the m* — Afinfaii 
relation. It is not likely that data from higher redshifts would limit 
the m* — A/infaii relation to a naiTow range. This is because the 
same uncertainties discussed here would be valid at high-z, in ad- 
dition to the larger observational errors inherent at these redshifts. 
Moreover, once the m* — Afinfaii relation is fixed at some redshift 
range, it is not straightforward to decide which models violate the 
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physics of galaxy formation by linking galaxies at different epochs. 
For example, as we showed in paper I, the m* — A/infaii relation 
for satellite galaxies at the time of infall might already be different 
from the relation for central galaxies at the same epoch. 

Our empiric al results have im plic ations for various asp ects of 
galaxy formation. IGuo et aT I( l2010l) and lSawala et"ai] ( l2oT l|) argued 
that stellar fractions at a fixed subhalo mass derived from basic 
ABM models are systematically lower than the results of detailed 
hydro-simulations. Our results that allow for a much larger spread 
of stellar mass at a fixed subhalo mass, are less definitive in this 
respect. This is especially true for dwarf galaxies, for which there 
is no available clustering data. An additional implication is related 
to the cumulative energy injected into the interstellar medium in a 
supe rnova feedback-cons trained scenario. Shankar et al. (2006; see 
also iDekel & Wooll2003h re-derived the expected trend of stellar 
and halo mass of the type m* oc .fsvLv-jM^^f^n. Here 1 < q < 2 
and /surv is the fraction of surviving stellar mass. Based on the ob- 
served rather low stellar mass in haloes with Afinfaii < 10" Mq 
inferred from a basic ABM model, they argued that supernova feed- 
back appeared to be insufficient to remove the gas associated with 
the host halo. The results of this paper, however, show that some 
ABM models allow for much larger stellar mass fraction in low 
mass haloes, thus providing some hints towards the solution of this 
puzzling issue. 

Another possible non-trivial consequence of our results con- 
cem s Halo Occupation D i stribu tion (HOD) models. As outlined in 
e.g.. lBerlind & Weinber j ( |2002|) . this class of models is based on a 
parameterized conditional probability P[N\M) that a dark matter 
halo of virial mass M contains galaxies. Adding prescriptions on 
the spatial distribution of subhaloes derived from accurate A'^-body 
simulations, HOD models have been rather successful in reproduc- 
ing the two-point correlation function of different classes of galax- 
ies and at different redshifts. However, once we allow for a different 
ranking between satellite and central galaxies, as was found in this 
paper, the shape of P{N\M) might become rather different from 
its usual functional form. This could induce non-trivial degenera- 
cies in these models, allowing for different occupation distributions 
that still match the data. 

The methodology developed in this work can be used to study 
the clustering properties of AGNs, cold gas, and subsets of galax- 
ies divided by e.g. luminosity or color As was demonstrated here, 
our method already enabled us to better fit the observed stellar 
mass functions and CPs of all galaxies, with the current data sets. 
It might become more important when studying clustering prop- 
erties of objects which tend to depend more strongly on environ- 
ment, like the cold gas mass within galaxies. Our method may 
be able to shed some light on the limitations in mod eling the ob- 
serve d high-z galaxies using standard halo models jOuadri et al.l 
I2OO8I : [Tinker etal.ll2010l : IWake et al.ll201 lb . In addition, it might 
be useful for future surveys with high quality data and a large sur- 
vey volume, for which the simple abundance matching approach 
might not be flexible enough to provide an accurate fit to the data. 

Improved constraints on the m* — Afinfaii relation might 
be obtained from observational quantities, other than the stel- 
lar mass function and clustering used here. These mi ght include: 
condit ional stellar mass functi ons per halo mass (e.g. lYang et al] 
[2009?), satellite kinematics (More et al. '201 ij), weak lensing 
CMandelbaum et al. 2006; Cacciato et al. 2009), star-formation his- 
tories, and dynami cal tracers such as th e velocity dispersion and cir- 
cular velocity (e.g. lDutton et aLlboiOl) . In addition, physically mo- 
tivated models like hydrodynamical simulations or semi-analytic 
models are crucial to obtain insights on the difference between the 



evolution of satellite and central galaxies. More effort to under- 
stand the fundamental relation between the mass of subhaloes and 
galaxies is clearly needed both on the observational and theoretical 
side. 
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APPENDIX A; 
GALAXIES 



THE LOCATION OF SATELLITE 



Here we describe the model we use for fixing the location of unre- 
solved subhaloes. These subhaloes are not identified at z = 0, but 
can still host galaxies according to the dynamical friction estimate. 

Consider a massive point particle subject to dynamical friction 
in an infinite mass distribution with density profile p cx r~^. In the 
limit of a circular orbit for the point particle and an isothermal pro- 
file (7 = 2) for the background density, the decay of the satellite's 
orbit due to dynamical friction can be modeled analytically (eq. 7- 
25 of Binney & Tremaine 1987). Assuming that 7 does not differ 
much from 2, we can follow the same arguments as Binney and 
Tremaine and find that the equation governing the decay rate for 
the specific angular momentum L is 



At 



Fr 



-clnAMsat r 



vlir) 



(Al) 



where In A is the Coulomb logarithm (assumed to be constant), c is 
a constant, Vc is the circular velocity, and F is the frictional force. 
Using L — rvc, and Vc (x r 'Z^+i we get 



dL dr „^/2+i -1 
dt dt 

Integrating, we obtain 



r = rsat(l - t) 



1/(3-7/2) 



(A2) 



(A3) 



where r is the fraction of the initially estimated dynamical friction 
time-scale that has elapsed until z ~ 0, i.e. t{z = 0) — t(zsat) = 
rtdf, and Zaat is the last redshift the subhalo was identified. For 
additional flexibility, we allow r to vary as well, coiTesponding to 
the overall uncertainty in the dynamical friction time-scale: 

^p^l/'^ 



:r,at(l-r'')^ 



(A4) 



In Fig. lAll we compare the distribution of r/rsat when using 
the most bound particle, against the model summarized in Eq. lA4l 
Subhaloes from the simulation are chosen according to Odf. For 
each unresolved subhalo we compute the location of the last identi- 
fied most bound particle, this is shown in solid lines. Dashed lines 
show the results when using the model discussed above. For each 
value of Qdf plotted, we have optimized the parameters p, q to get 
the best fit. Typically, rsat is lower than 1 Mpc, and the CF mea- 
surements are reliable above ~ 10 kpc. Consequently, the differ- 
ence between the plotted lines at r/rsat < 10~^ is of less impor- 
tance to this study. We have checked that the values of p, g plotted 
here, agree with the location set by the most bound particle also 
when computing the CFs. Lastly, we list in Table IaTI the combi- 
nations of satellite locations and Odf that were used for the full 
parameter search discussed in section]?] 



APPENDIX B: SCANNING THE PARAMETER SPACE 

As explained in section |4l each model within our formalism is de- 
fined using 18 free parameters that fix the five domains in stellar 
mass used here. There are additional three parameters that define 



The stellar mass of central and satellite galaxies 1 5 




Figure Al. The cumulative distribution function (CDF) of r/r^^t. CDF 
using the location of the most bound particle are plotted in solid lines for 
0;jf = 0. 1, 3, 10 as indicated. For each value of a^f we plot in dashed line 
the best fitting model using Eq. lA4l 



(and the related integral involving ^/)) for each model, but only com- 
pute the incremental changes of this integral when going over the 
full list of models. 

• We compute CF values starting the smallest radius. After the 
CF is computed for each radius, we check whether the model fits 
the data. This allows us to quickly reject bad solutions. 

Our algorithm is able to compute CF for each domain within a 10^^ 
second, using one processor. In total, we have computed the CF for 
more than ~ 10^ models. The CFs for all models are based on the 
full sample of subhaloes from the Millennium simulation. 



Table Al. The set of parameters used for scanning the parameter space in 
section |4] 'mbp' refers to a model with satellite locations set by the most 
bound particle of the last identified subhalo. 



Satellite location (p) 


q 




mbp 




0.1,3, 8, 10 


0.1 


0.4 


0.1, 3 


0.14 


0.6 


0.1, 3, 10 


0.5 


0.8 


0.1, 1,3, 10 


1.0 


1.0 


0.1, 1 


0.17 


0.4 


10 



the dynamical friction time scale (Odf ), and the location of unre- 
solved subhaloes (p, q). Even though this seems to be a huge pa- 
rameter space, we managed to scan it systematically and with high 
resolution. In this section we explain the numerical details that al- 
low this parameter search. 

• Searching the parameter space is done separately for the dif- 
ferent domains of stellar mass. This reduces the number of inde- 
pendent parameters to those that define a domain (i.e. 6). Only in a 
later stage we search for solutions that combine a set of 5 domains 
that have mutual boundaries. 

• We search for solutions one domain after another, starting 
from the most massive one. At each domain we require that the 
model boundaries will coincide with an accepted model from the 
previous domain. 

• We first check that the stellar mass function meets the fitting 
criterion. Only after we have a solution for all the domains we 
check the fit to the CF. 

• Due to the number of bins we adopt for Afinfaii and 6i each 
domain has 10^*^ optional combinations of ai,(3i,5i. We first com- 
pute separately the contributions to the mass function (Ea.ll4t from 
satellite and central subhaloes {(f>c, 4>a)- Only in a later stage we 
combine all the 10^" options to see which of them fits the data. 

• Before computing the CF we sort the different models such 
that there will be a minimum difference between neighboring mod- 
els in the list. We then compute the CF using the 'moving average' 
scheme. This means that we do not compute the integral of Eg. 1 141 



